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Abstract — This paper connects multi-agent patii planning on 
graplis (roadmaps) to network flow problems, showing that 
the former can be reduced to the latter, therefore enabling 
the application of combinatorial network flow algorithms, as 
well as general linear program techniques, to multi-agent 
path planning problems on graphs. Exploiting this connection, 
we show that when the goals are permutation invariant, the 
problem always has a feasible solution path set with a longest 
finish time of no more than n + V — I steps, in which n is 
the number of agents and V is the number of vertices of the 
underlying graph. We then give a complete algorithm that finds 
such a solution in 0{nVE) time, with E being the number of 
edges of the graph. Taking a further step, we study time and 
distance optimality of the feasible solutions, show that they have 
a pairwise Pareto optimal structure, and again provide efficient 
algorithms for optimizing two of these practical objectives. 

I. Introduction 

Consider the problem illusttated in Fig. [T] which inspired 
the authors to pursue this research. As an exercise (26-1 
in [7]), the escape problem is to determine, given m < 
evaders placed on m different points of an « x « grid, whether 
there are m vertex disjoint paths from these m locations to 
m different points on the boundary of the grid. Intended as 
a demonstration of applications of maximum flow algorithms 
(Ch. 26 of [7]), it undoubtedly mimics multi-agenfl path 
planning problems on graphs. Intrigued by the elegant net- 
work flow based solution to the escape problem, we wonder: 
How tightly are these two classes of problems intertwined 
and how we may take advantage of the relationship? 





Fig. 1 . Examples of the escape problem on a 6 x 6 grid. The black discs are 
the initial evader locations. The goal is to plan disjoint paths for the evaders 
to reach different vertices on the boundary of the grid, a) An instance with 
solution given as the bold edges, b) An instance without a solution. 
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' We use agent instead of robot since the method applies to scenarios such 
as evacuation planning. 



In this paper, we explore and exploit the connection 
between multi-agent path planning on collision-free unit- 
distance graphs (or CUGs, see Section HI] for the definition) 
and network flow. We begin by showing that multi-agent path 
planning on CUGs is closely related to a class of problems 
called dynamic network flow or network flow over time. We 
then focus on the permutation invariant multi-agent path 
planning problem on CUGs (by permutation invariant, we 
mean that goals are not pre-assigned to agents. Instead, we 
only require that each goal is reached by a unique agent), 
establishing that such problems always have solutions. To 
solve the problem algorithmic ally, an adapted maximum flow 
algorithm is provided which plans collision free paths for 
all agents with worst time complexity 0{nVE), in which 
n is the number of agents, V is the number of vertices 
of the CUG and E is the number of edges of the CUG. 
Moreover, we guarantee that the last agent takes time no 
more than « + V — 1 to reach its goal, assuming that agents 
travel at unit speed. Next, we construct efficient algorithms 
for obtaining temporally and spatially optimal solutions. For 
example, our algorithm for shortest overall time has running 
time 0{nVE\ogV). We also show that these temporal and 
spatial objectives cannot be optimized simultaneously (i.e., 
they have a Pareto optimal structure). Portions of the Pareto 
optimal structure collapse as we move to the permutation- 
invariant multi-agent path planning problem on CUGs with 
goal-replacement. 

As a universal subroutine in multi-agent systems, 
colUsion-free path planning for multiple agents finds appli- 
cations in tasks spanning assembly [18], [33], evacuation [4], 
[39], formation control [2], [38], [41], [43], [45], locaUzation 
[14], object transportation [31], [40], search and rescue [21], 
and so on. Given its importance, path planning for multi- 
agent systems has remained as a subject of intense study for 
many decades. Due to the vast size of the available literature, 
we only mention a most related subset of the research in this 
field and refer the readers to [5], [26], [28] and the references 
therein for a more comprehensive review of the subject. 

When all agents are treated as a single agent with a high 
dimensional configuration space, the problem can be solved 
using cylindrical algebraic decomposition [6] or Canny's 
roadmap algorithm [3], in theory. Such coupled approaches 
suffer from the curse of dimensionality; even when sampling 
based methods [23], [27] are used, instances involving only a 
small number of agents can be computationally challenging. 
This difficulty prompts the study of methods that seek to 
explore local features whenever possible to avoid working 
with too many agents at a time. Among these, decoupled 
planning is the most popular, which generally performs coor- 



dination of robot motion after deciding a path for each robot 
[17], [22], [34], [37], [42], [49]. In contrast, priority based 
methods force an order on agents to significantly reduce 
the search space [10], [47]. Some more recent works using 
decoupling heuristics include applying optimal decoupling 
techniques to exploit problem instances with low degrees 
of coupling [48], using push-and-swap primitives to avoid 
unnecessary exploration of search space [30], and heuristics 
aimed at performance guarantees (completeness is lost) [50]. 

Our algorithmic efforts in this paper focus on the per- 
mutation invariant multi-agent path planning problem on 
CUGs. Such formulations, in both discrete and continuous 
forms, are extensively studied as formation control problems 
[2], [38], [41], [43], [45], among others. On reseai'ch that 
appears mostly related to this aspect of our paper, a discrete 
grid abstraction model for formation control was studied in 
[32]. To plan the paths, a three-step process was used in 
[32]: 1) Target assignment, 2) Path allocation, 3) Trajectory 
scheduling. Although it was shown that the process always 
terminates, no characterization of solution complexity was 
offered. In contrast, we provide very efficient algorithms 
that solve a strictly more general class of problems with 
optimality assurance. On the continuous side, a novel for- 
mation space approach was employed to represent the entire 
formation of robot teams with a single polynomial of which 
the roots correspond to the unassigned configurations for the 
robots in the formation [24]. 

We delay the literature review on network flow, from 
which we devise our time expansion construction for multi- 
agent path planning, to Section HII] The basic idea of apply- 
ing time expansion to robotics problem is far from new [10], 
[36]. To the best of our knowledge, however, the research 
presented here is an original attempt at proposing a general 
time expansion technique, connecting it to network flow, and 
making full use of the benefits that come with this approach. 
We also note that our exact and complete algorithms all 
come with low constants in their respective worst case time 
complexity because they are derived from well studied fully 
combinatorial algorithm^ Our simulation result, which we 
omit due to the length limit, confirms this assertion. 

There are three main contributions. First, we formally 
establish the link between multi-agent path planning on 
graphs and network flow, showing how multi-agent path 
planning can be reduced to network flow problems, thereby 
enabling the potential application of powerful tools from 
combinatorial optimization to path planning for multiple 
agents in a principled way. Second, for the planning problem 
in which agents do not have pre-specified goals, we give 
fast and complete algorithms for finding collision free path 
sets that deliver every agent to a different goal. Third, we 
study time and distance optimality of the feasible solutions 
to the aforementioned problem, show that they have a 
pairwise Pareto optimal structure, and again provide efficient 
algorithms for optimizing two of these practical objectives. 

^ A fully combinatorial algorithm is an algorithm that only adds, subtracts, 
and compares values; no multiplication and division operations are allowed 
(i.e., ordered group operations versus oideied field operations) [16]. 



The rest of the paper is organized as follows. In Section 
lU we define three multi-agent path planning problems on 
CUGs. Section [III] starts with a quick review of network 
flow problems and then proceeds to show the reduction 
from multi-agent path planning on CUGs to network flow. 
Concentrating our efforts on the permutation invariant multi- 
agent path planning problem. Section |IV] begins with a key 
construction that allows us to tightly bound the time steps 
required for a time-expanded network to have a feasible 
solution, which in turn enables efficient algorithms. Section 
IVl takes a further step and studies solution optimality on 
three natural objectives, showing the objectives have a Pareto 
optimal structure. Section |VT] provides a quick discussion of 
the goal-replacement case, introduces a fourth objective, and 
shows that the three temporal objectives lose the partial order 
structure. We conclude in Section IVIII 

II. Multi-agent Path Planning Problems on 
Collision-free Unit-distance Graphs 

Let G = {V,E) he a connected, undirected, simple graph 
(i.e., no multi-edges), in which V = {v,} is its vertex set 
and E = {(v,-, V;)} is its edge set. Let A = {ai,. . .,a„} be a 
set of agents with initial and goal locations on G given by 
the injective maps x/ : A — > V and : A — > V, respectively. 
Note that A is essentially an index set; xj{A) and X(j{A) are 
the set of initial and goal locations, respectively. We require 
that x/(A) and xg{A) be disjoint. For convenience, we let 
n = |A| and use V,E to denote the cardinality of the sets 
V,E, respectively, since the meaning is usually clear from 
the context. Let CJ be a bijectior0 that acts on xq, a feasible 
path for a single agent a, is a map /?, : Z+ V with the 
following propertied 1- Pi{0) =X[{ai). 2. For each /, there 
exists a smallest kmin G Z+ such that pi{kmm) = {(y oxa){ai) 
for some fixed a. That is, the end point of the path pi is some 
goal vertex. 3. For any k > k,^^, pi{k) = {aox(j){ai). 4. For 
any < < ^min, (piik) , pi{k + I)) e £ or pi{k) = 1). 
Intuitively, think of the domain of the paths as discrete time 
steps. We say that two paths pi,pj are in collision if there 
exists kE Z+ such that pi{k) = pj{k) (meet) or {pi{k),pi{k-\- 
1)) = {pj{k+\),pj{k)) (head-on). If p{k) = p{k+\), the 
agent stays at vertex p{k) during the time interval 1] . 

As mentioned, in this paper, we work with a specific type 
of graph called the collision-free unit-distance graph (CUG): 
A CUG is a connected, undirected graph G satisfying the 
following: 1. Every edge is of unit length; 2. Given any two 
distinct edges (mi,vi) and [ui^vi) of GwithMi 7^M2,vi ^V2, 
two disc shapes (or spherical for 3D or more) agents of radius 
less than ■\/2/4 traveling at unit speed through these edges 
(starting simultaneously at ui,U2, respectively) will never 
collide. A radius of \/2/4 is the largest possible for two 
adjacent agents to travel along an "L" shaped path. One can 
easily verify that any graph with unit edge length and no 
acute angles between adjacent edges is a CUG. Hence, a 
connected 2D grid with holes is a CUG. Since subgraphs 

■^(J is introduced to unify the problem formulations; its use will become 
clear shortly. For now, the reader may think of it simply as the identity map. 
'*ln this paper, we let Z+ := NU {0}. 



of 2D grids are easy to draw and visualize, we generally 
use subgraphs of 2D grids when we create examples in this 
paper With the above setup, the multi-agent path planning 
on CUGs problem is defined as follows. 

Problem 1 (Multi-agent Path Planning on CUGs) Given 
a 4-tuple {G,A,xi,xq) in which G is a CUG, find a set of 
paths P = {pi,. . . ,p„} such that pi 's are feasible paths for 
respective agents a; 's with <J being the identity map and no 
two paths Pi,Pj are in collision. 

We require the graph to be a CUG so that it is suitable 
for multi-agent path planning. We formalize the rationale in 
the following lemma. 

Observation 2 Let Pi,Pj be two paths that are not in 
collision ( as a partial solution to Problem [7]). Then two disc 
shaped agents^of radius less than a/2/4, starting at the same 
time and moving along these respective paths with unit speed, 
will never collide. 

Proof. Without loss of generality, assume the two 
agents (say, a,- and a,) started moving at time f = 
along Pi and pj, respectively. Between any time 
steps k and k + \, k > 0, agents a,- and aj travels 
along edge {pi{k),pi{k +1)) and {pj{k),pj{k +1)), 
respectively. Since p, and pj are not in collision, we 
always have pi{k) ^ Pj{k), pi{k + 1) ^ pj{k + 1), and 
{pi{k),pi{k+l)) ^ {pj{k+l),pj{k)). By the definition of 
CUG, two disc shaped agents of radius less than V2/4- 
traveling on these edges at unit speed will never collide 
between time interval Since k is arbitrary, the two 

agents will never collide. □ 

Observation 12] shows that a solution to Problem [T]provides 
a path set for disc agents with radius V2/4 in A to reach 
their respective goals without a collision. It is easy to see 
that not all instances of this problem are solvable. Moreover, 
the decision version of Problem [T] (i.e., is there a solution 
that takes the agents to goals within K time steps) is NP- 
complet^ If we remove the assumption that all agents 
must reach their respective goals and allow permutation 
invariant paths (i.e., as long as each goal gets occupied by a 
unique agent in the end), Problem[T]becomes the permutation 
invariant multi-agent path planning on CUGs problem. 

Problem 3 (Permutation Invariant Multi-agent Path Plan- 
ning on CUGs) Given a 4-tuple {G,A^xj,xg) in which G is a 
CUG, find a set of paths P = {pi, . . . ,p„} such that pi 's are 
feasible paths for respective agents ai's for an arbitrary (but 
fixed) permutation O and no two paths Pi,Pj are in collision. 

'Or spherical agents with radius less than \/2/4, for dimensions higher 
than 2. 

*The lengthy proof is out of scope for the cun'ent paper For curious 
readers, the NP-hardness proof is similar to that from [44]; the NP 
membership proof, which is non trivial, leads to fast heuristics for solving 
Problem [T] 



Problem|3]models the problem in which multiple identical 
or indistinguishable agents need to be deployed for serving 
requests at different locations. This problem always has a 
solution: We simply plan and execute one path at a time and 
use more "remote" goal vertices earlier to avoid possible 
blocking of later paths; a proof of the existence of such 
a choice of paths is given in Section |IV] Going one step 
further and allowing multiple agents to reach the same goal 
(at different time steps), we get the permutation invariant 
multi-agent path planning on CUGs with goal replacement 
problem. 

Problem 4 (Permutation Invariant Multi-agent Path Plan- 
ning on CUGs with Goal Replacement) Given a 4-tuple 
{G,A,xi,xg) in which G is a CUG, find a set of paths P = 
{pi, . . . ,p„} such that Pi's are feasible paths for respective 
agents ai's for an arbitrary (but fixed) O and no two paths 
Pi,Pj are in collision except at vertices of xq{A). 

This problem provides a realistic model for scenarios such 
as evacuation of a building due to fire hazard; the set X(j{A) 
represents the exits to safety: After an agent reaches an exit, 
the agent can be removed from the system and the exit 
becomes available again shortly after Since G is connected 
and goals can be reused, a feasible solution to Problem |4] 
can be easily obtained by sending all agents to a single goal 
vertex sequentially. 

III. Multi-agent Path Planning on CUGs and 
Network Flow 

A. Network Flow 

In this subsection we give a brief review of network flow 
problems and algorithms pertinent to our problems. For 
surveys on network flow, see [1], [13]. We start with the 
classic static network flow problems. 

Static Network Flow. A network jV — {G,u,c,S) consists 
of a directed graph G^ {V,E) with u,c:E Z+ as the maps 
defining the capacities and costs on edges, respectively, and 
5 C y as the set of sources and sinks. We let S = S^U S^, 
with 5+ denoting the set of sources and denoting the set 
of sink vertices. For a vertex v eV, let 5+(v) (resp. 5^(v)) 
denote the set of edges of G going to (resp. leaving) v. A 
feasible (static) 5+,5^-flow on this network yy is a map 
f : E ^ Z+ that satisfies edge capacity constraints, 

yeeE, fie)<u{e), (1) 

the flow conservation constraints at non terminal vertices, 

WveV\S, J2 - E /(^)=0' (2) 

and the flow conservation constraints at terminal vertices, 

E( E - E /w) = 

veS+ eeS-iv) eeS+{v) 

E( E /(^)- E /(^))- 

veS~ ee5+{v) eeS-{v) 



The quantity on either side of Q is called the value of the 
flow. 

The classic (single-commodity) maximum flow problem 
asks the question that given a network ,_A^ , what is the 
maximum value of flow that can be pushed through the 
network (i.e., seeking to maximize F)l The minimum cost 
maximum flow problem further requires the flow to have 
minimum total cost among all maximum flows. That is, we 
want to find the flow among all maximum flows such that 
the quantity 

Y.<'{e)-f{e) (4) 

eeE 

is minimized. Given integer inputs, integer maximum flow 
always exists, and many polynomial time algorithms exist 
for finding such a solution [9], [15]. The minimum cost 
maximum flow problem is equivalent to the minimum cost 
circulation problem, which is also solvable in polynomial 
time [46]. 

When additional structure is put on S, additional questions 
arise. If we limit the supply (demand) of the source (sink) 
vertices, we obtain a type of the flow problem called the 
transshipment problem. To formalize this, let c/ : V — > Z be 
the supplies on the vertices of G. Given a vertex y e V, a 
positive d{v) suggests that the vertex has positive supply (v e 
5+) and a negative one suggests that the vertex has positive 
demand (v e S ). For all other vertices v, d{v) ~ 0. The basic 
version of the transshipment problem asks for a feasible flow 
through the network that also respects the supply/demand 
requirements 



eeS+{v) eeS-{v) 



(5) 



The transshipment problem becomes the evacuation problem 
when \S^\ = 1 and the demand of the single sink vertex 
is equal to the total supply of the source vertices. The 
transshipment problem and the evacuation problem, as 
special cases of the maximum flow problem, can be 
solved with maximum flow algorithms mentioned above. 
If we instead require that vertices of are paired up 

as {si,s[),. . . ,{sk,s'i^) and that commodity of type / can 
be injected only into i, and taken out at s'j, we get the 
multi-commodity flow problem. Optimality questions as 
these from the single-commodity case can be asked here as 
well. Unlike in the single commodity case, finding integer 
maximum flow for multi-commodity problems is NP-hard 
in general and MAX SNP-hard (NP-hard to approximate 
below a certain multiple of optimal flow value) even for 
some simple restrictions [8]. 

Dynamic Network Flow. If we consider that flowing com- 
modities through edges takes some time to complete, the 
problem becomes a dynamic network flow problem, which 
sometimes is also called network flow over time. There are 
two common variations of the dynamic network flow model; 



Discrete time and continuous time. In a discrete time model, 
flows enter and exit from vertices at integer time steps 
f = 0,1,..., r. For a given edge e — (m,v) e E, we may 
view the cost c(e) as the time that is required to pass an 
amount of flow (not exceeding the capacity) from the tail 
u to the head v of the edge e. Therefore, we may interpret 
a (static) flow network ,A' as a dynamic one without any 
change of notations. In the closely related continuous time 
model, which we do not use in this paper, a flow rate is 
assigned to each edge, designating how fast a unit of flow can 
pass through the edge. The constraints imposed in the static 
network flow model generally apply to dynamic network flow 
models, except that dynamic network flow further requires 
that at any time, the flow passing through any edge cannot 
exceed the edge capacity. 





(a) 



Fig. 2. a) The (static) flow network witli source and sink .s". Tlie 
numbers on the edges are the costs/time delay for passing tlirough these 
edges. We may assume that the capacities are all unit capacities, b) The 
time-expanded network with 5 copies of the original vertices (T = 4). All 
edges have unit capacity. There is a forward edge between two vertices u 
and V at time steps t and t' , respectively (e.g. j: at ^ = and y at t' = 1), if 
one of the following is true: 1. e = is an edge of the static network 
with c(e) =t' — t (the black edges, which retain the costs as c(e)'s); 2. u,v 
are the same vertex of the static network and t' — t = I (the green edges, 
which have unit costs). The green edges are also called holdover edges since 
traveling through a green edge is the same as the agent not actually moving. 

Given a dynamic flow network, a question similar to the 
single-commodity maximum flow problem is the following: 
Starting at t ~ 0, what is the maximum units of flow the 
can reach the sinks on or before time t = Tl It turns out 
that this problem can be solved using static flow algorithms 
such as Edmonds-Karp [9] over a time-expanded network. 
For example, given the dynamic flow network in Fig. [SJa), 
its time-expanded network with T = 4 is given in Fig. |2|b). 
To compute a flow over the time expanded network, we first 
add a super source and connect it using outgoing edges to 
all copies of source vertices at f = 0, and add a super sink 
and connect all copies of sink vertices for all t to it using 
outgoing edges (the super source, super sink, and additional 
edges are not shown in Fig. I2b)). 

Lemma 5 For a sufficiently large T, a flow for a dynamic 
flow network ,jV is feasible if and only if the corresponding 
static flow on the time-expanded network of ,jV is feasible. 

A proof of Lemma |5] can be found in [12]. Note that 
determining a minimally sufficient T required by Lemma 
|5] which directly affects the running time of the result- 
ing algorithm, is non-trivial. The standard maximum flow 



algorithms have time complexity depending polynomially 
on T and are therefore pseudopolynomial in general. For 
a special class of problems, the quickest transshipment 
problem, of which the goal is finding the quickest feasible 
flow for a transshipment problem over a dynamic network, 
strongly polynomial time algorithm^ exists [19]. However, 
the algorithm requires calling subroutines (for example, 
submodular function optimization routines) that are not fully 
combinatorial algorithms and also has with large constant 
terms when it comes to asymptotic time complexity. As we 
will see, our problems can be solved using polynomial time 
fully combinatorial algorithms, due to their special structures. 

B. Equivalence Between Multi-agent Path Planning on 
CUGs and Maximum Network Flow 

In this subsection, we establish a reduction from the 
problems of our interest to multi-commodity network flow. 
For illustration purpose, we use the simple graph G in 
Fig. lUa), with initial locations {s/^},' = 1,2 and goal lo- 
cations {sj},i= 1,2. An instance of Problem [T] is given by 
{G,{ai,a2] : a/ 1-> sf ,xo '■ fl; ^ s'^). To be able to apply 
maximum flow algorithms, we construct from G a time- 
expanded directed graph G', part of which is shown in Fig. 
Oc). We construct Fig. [Sjc) as follows. 




(a) (b) (c) 

Fig. 3. a) A simple CUG G. b) A gadget for splitting an undirected edge 
through time steps, c) Part of the time-expanded network (T = 2). 



Since we cannot create an infinite time-expanded network, 
we need to specify the required number of time steps. For 
now assume that this number is some sufficiently large 
T (that is, if a flow with value F is achievable with an 
arbitrarily long time expansion, then F is also achievable 
with only T time steps). After fixing T , we create 2T + 
1 copies of vertices from G, with indices 0, 1,1',.--, as 
shown in Fig. He). For each vertex v e G, we denote 
these copies v(0) = v(0)', v(l), v(l)', v(2), . . . , v(r)'. For each 
edge (m, v) G G and time steps f ,f + 1, < f < T, we then 
add the gadget shown in Fig. [3jb) between M(f)',v'(0' and 
M(f + 1), v(f + 1) (arrows from the gadget are omitted from 
Fig- EIc) since they are too small to draw). This gadget 
ensures that two agents cannot travel in opposite directions 
on an edge in the same time step. For the gadget, we assign 
unit capacity to all edges, unit cost to the horizontal middle 

'An algorithm is a strongly polynomial algorithm if: 1. The number 
of operations in the arithmetic model of computation is bounded by a 
polynomial in the number of integers in the input instance, and 2. The 
space used by the algorithm is bounded by a polynomial in the size of the 
input [16]. 



edge, and zero cost to the other four edges. To finish the 
construction of Fig. He), for each vertex v e G, we add 
one edge between every two successive copies (i.e., we add 
the edges (v(0),y(l)), (y(l), v(l)'), • • • , (v(r), v(r)')). These 
correspond to the green and blue edges in Fig. EJc). For all 
green edges, we assign them unit capacity and cost; for all 
blue edges, we assign them unit capacity and zero cost. 

The graph Fig. |3lc) is the main piece of G', which is 
mostly done with the exception of the set S. We may simply 
let S+ = {m(0) : u e {sj}) and = {v{T)' : v e {sj}). That 
is, 5+ contains the first copies of the initial locations and 
the last copies of the goal locations. The network jV' = 
{G' ,u,c,S^US^) is now complete; we have reduced Problem 
[T]to an integer maximum multi-commodity flow problem on 
with each agent from A as a single type of commodity. 

Theorem 6 Given an instance of Problem [7] with input 
parameters {G,A,xi,xg), there is a bijection between its 
solutions (with maximum number of time steps up to T) and 
the integer maximum multi-commodity flow solutions of flow 
value n on the time-expanded network jV' constructed from 
(G,A^xi,xq) with T time steps. 

Proof. 

(Injectivity) Assume that P ~ {pi, . . . ,Pn} is a solution to 
an instance of Problem [T] For each /?,■ and every time step 
t —Q,...,T, we mark the copy of piit) and (recall 
that pi{t) corresponds to a vertex of G) at time step t in 
the time-expanded graph G'. Connecting these vertices of G' 
sequentially (there is only one way to do this) yields one unit 
of flow fi on jV' (after connecting to appropriate source and 
sink vertices in 5^,5^, which is trivial). It is straightforward 
to see that if two paths pi,pj are not in collision, then the 
corresponding flows /I,// on ^/K' are vertex disjoint paths 
and therefore do not violate any flow constraint. Since any 
two paths in P are not in collision, the corresponding set of 
flows {/i , . . . ,/„} is feasible and maximal on . 

(Surjectivity) Assume that {/i,...,/„} is a integer 
maximum multi-commodity flow on the network o/K' with 
\fi\ — 1. First we establish that any pair of flows //,/; are 
vertex disjoint. To see this, we note that /, ,// (both are unit 
flows) cannot share the same source or sink vertices due 
to the unit capacity structure of J^' enforced by the blue 
edges. If fi,fj share some non-sink vertex v at time step 
f > 0, both flows then must pass through the same blue edge 
(see Fig. lUb)) with v being either the head or tail vertex, 
which is not possible. Thus, /,,.// are vertex disjoint on 
,y\^' . We can readily convert each flow /, to a corresponding 
path Pi (after deleting extra source vertex, sink vertices, 
vertices in the middle of the gadgets, and tail vertices of 
blue edges) with the guarantee that no pi^Pj will collide 
due to "meet". By construction of o/K', the gadget we used 
ensures that "head-on" collision is also not possible. The 
set {pit ■ ■ ,pn} is then a solution to Problem [T] □ 

Since integer maximum multi-commodity flow is NP-hard, 
the above construction does not directly offer an efficient 



solution to Problem [T] Nevertheless, with backtracking, it 
is not hard to design complete algorithms that search the 
time-expanded network ,_A^' for a feasible solution. Our 
preUminary analysis shows that when the problem instance 
is not particularly hard (i.e., when there are not many narrow 
passages in the combined configuration space), a solution can 
be found relatively quickly. We plan to study this problem 
in more detail in future work. Alternatively, we may readily 
obtain an integer linear programming problem from jY' and 
apply heuristics such as branch and hound method [25], 
for which many heavily optimized numerical packages are 
readily available. Moving to Problem[3] allowing an arbitrary 
permutation a to act on xq means that we may treat all 
agents as a single type of commodity. Theorem |6] gives us 
the following corollary. 

Corollary 7 Given an instance of Problem |5] with input 
parameters (G^A^Xi^xq), there is a bijection between its 
solutions (with maximum number of time steps up to T) and 
the integer maximum flow solutions of flow value n on the 
time-expanded network jY' constructed from (G^A^xi^xq) 
with T time steps. 

At this point. Problem [3] can be solved in polynomial 
time using the algorithm for the quickest transshipment 
problem with linear programming subroutines for optimizing 
submodular functions. In the next section, we show that 
we can do better by bounding the required time steps for 
finding a feasible solution to Problem |3] and then apply more 
standard combinatorial algorithms for network flow. 

IV. Efficient Combinatorial Algorithms for 
Permutation Invariant Multi-agent Path 
Planning on CUGs 

If we choose to apply combinatorial network flow algo- 
rithms over the time-expanded network to find solutions to 
Problem |3] the first priority is to determine the required 
number of time steps necessary to find a solution; otherwise 
we cannot declare that the algorithm is complete. We now 
provide a tight bound on T . Let (G,A,x/,xg) be an instance 
of Problem [3] We first prove some intermediate results 
on path sets over G. To distinguish these paths from the 
solution path set, denote them Q — {qi,. . . ,q„}. For 
convenience,/!eat/(^,), tail{qi), and len{qi) denote the start 
vertex, end vertex, and length of qi, respectively. With a 
slight abuse of notation, V {■),£{■) denote the vertex set and 
edge set of the input parameter, which can be either a path, 
qi, or a set of paths, such as Q. An intersection between 
two paths is a maximal consecutive sequence of vertices and 
edges common to the two paths. A standalone goal vertex 
is a vertex v & xq{A) such that there is a single path q E Q 
containing v. To start off, we want a path set Q with the 
following properties; 

Property 8 For all 1 <i<n, head{qi) G x/(A) and tail{qi) € 
xq{A). For any two paths qi,qj, head{qi) ^ head{qj) and 
tail{qi) ^ tail{qj). 



Property 9 Each path q, is a shortest path between head{qi) 
and tail{qi) on G. 

Property 10 The total length of the path set Q is minimal. 

Property 11 If we orient the edges of every path qt G Q 
from head{qi) to tail{qi), no two paths share a common edge 
oriented in different directions. 

Properties |8] and |9] are merely restrictions to have the initial 
and goal vertices paired up using shortest paths. Property 
[TOl requires the total length of these paths to be minimal. 
Property (TT] which is implied by Property [TO] lends to show 
that the paths can be oriented to form a directed acyclic 
graph. 

Lemma 12 There exists a set of paths Q — {^i, . . . ^q„} that 
satisfies Properties 151 1771 

Proof. 

The first property is trivial to satisfy: Given an arbitrary 
instance of Problem [3] an arbitrary pairing of vertices, one 
from xiiA) and one from xq{A), will meet the requirement. 
Since G is connected, for each pair of vertices of the 
form [s'l ,sj) with s'l G x/(A) and sj G xg(A), there is a 
path between them with finite distance. Picking an arbitrary 
shortest path qi between sf , sJ on G (there may be multiples 
of these; for example when G is a grid graph) for 1 < / < n 
satisfies the second property. The third requirement can be 
satisfied by checking all possible path sets satisfying the 
first two requirements and select one with the smallest total 
distance. 
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Fig. 4. Two opposite running paths, qi = (O^uva^ s- and qj = 
s^mfvucojs^ (black paths), have total length at least 2 longer than that 

of q'- = s^O)^wys^ and q'- = s^a^w^s^ (green paths). 

I ' ' J J J J J ^ t 

With Properties ISlfTOl satisfied, we claim that Property 
fTTlis automatic. Suppose the statement is false and assume 
that two oriented paths qi,qj run in different directions 
on some common edge (m,v). We may write the paths 
as qi = sf (O^ uv(0^ sj^ and qj = sJcOj'vuCOjsJ , in which 
COj^ is the path of q^ connecting sf to u (see Fig. |4l). 

, 0^+ , coj are interpreted similarly. Then, the paths 
q'; — sfcofcoys'^ and o' — s^cofco^sy have total length 
equaling len(qi) + len{qj) — 2, which contradicts the shortest 
total distance assumption. We conclude that no two oriented 
paths can have edges oriented in opposite directions. □ 

Above proof technique can be generalized to show that 
the oriented paths cannot form any directed cycles. 



Theorem 13 A path set Q that satisfies Properties \8 il0\ 
induces a directed acyclic graph (DAG) structure on E(Q). 

Proof. 

Since Property [TT] is implied by Properties ISlfTOl all edges 
of E{Q) have a unique orientation. Therefore, the path set Q 
induces a directed graph structure over E{Q). This implies 
that the statement of the theorem can only be false if there is 
a directed cycle in the induced directed graph. Since a single 
path from Q, being a shortest path, cannot form a directed 
cycle itself, at least two or more paths, say qi,...,qii, are 
need to form a directed cycle. Without loss of generality, 
we assume these k paths are all needed to form a cycle 
(removing any qj, 1 < i < k will leave no directed cycles 
by {qi, . . . ,qk}\qi)- That is, for each 1 < / < A:, the directed 
cycle, say C, has at least one edge that belongs only to 
qi (an illustration is given in Fig. |5]l. We show that we 
can update these paths, without violating Properties ISlfTol 
to obtain a path in the end that intersects itself, which 
contradicts Property [TO] 




Fig. 5. A hypothetical (directed) cycle in a path set. We can switch the 
heads and tails of qi.qi to get the green paths q[,q2 without changing 
the total length. Now we can remove q'^ and still have the same directed 
cycle. Peiforming the same procedure (essentially an inductive argument) 
eventually yields a path that contains a directed cycle. 

We may write qi as CO1UCO2VCO3, in which UCO2V is the 
maximal segment of qi that belongs to the cycle C; COi , 03 
may be empty. Some other path intersecting C must intersect 
u(02V at V (by the maximality of uativ) and have a segment 
belonging to C starting at vertex v; let q2 be such a path. 
Since q2 contributes some unique edges to C, there are 
some edges of q2 in C that follow v but do not belong 
to UCO2V. We can then write q2 — C04va)5WCO(,, in which w 
is the last vertex of q2 belonging to C. Note that UCO2V 
and CO4V may have edges that overlap. We can rearrange 
qi,q2 into q\ — (OiUCOivCOswCOf, and ~ (04V(0^. Clearly, 
len{q\) + len{q2) = len{q\) + len{q'2); that is, the new path 
set again satisfies Properties ISlfTOl We have shown that a 
path set {qi,. . -jqk} with a directed cycle can be rearranged 
to yield a path set such that {q\,qT,, . . . ,qi^} again contains 
the same directed cycle. Applying the same reasoning 
recursively shows that we can obtain a shortest path that 
intersects itself, which is not possible. □ 

Theorem \T3\ implies the following. 

Corollary 14 A path set Q that satisfies Properties IMTOl has 
a standalone goal vertex. 



Proof. 

By Theorem [T3] the paths in Q induces a directed 
acyclic graph on E{Q). This implies that at least one vertex 
from xg{A) must be a standalone goal vertex; otherwise 
every goal must be on another path and the directed path 
containing the goals must close to form a directed cycle 
because there are only a finite number of goals. □ 

The existence of a standalone goal vertex allows the 
construction of a path set which decomposes into paths 
that can be sequentially scheduled without colliding into 
each other We characterize such a path set as one with an 
additional property. 

Lemma 15 There exists a path set Q satisfying Properties 
\8il 1\ and the following additional property: 

Property 16 Let Qi := {qi, . . . ,qn}. For any 1 < i < n, 

restricting to Qi, among all possible paths connecting an 
initial location ( of Qi) to a standalone goal location ( of Qi) 
using oriented edges from E{Qi), qi is one shortest such. 

Proof. 

We begin with a Q = {qi, . . . ,qn} satisfying Properties 
[SifToland construct a Q' ^ Wi^- satisfying the desired 

property while preserving Properties ISTITO] By Corollary 
[T4I there are one or more standalone goal vertices. Among 
all possible paths connecting some initial and standalone 
goals using oriented edges from E{Q), we pick one among 
the shortest. This is q\. Note it is likely that q[ ^ Q, 
in which case we may assume head{q[) = head(qi) and 




(a) (b) 

Fig. 6. Possible cases for rearranging paths without affecting total length, 
a) q'^ is the lower green path, b) q'^ is the middle green path. 

tail{q\) — tail{qj) for some qi^qj G Q. There are two 
possibilities: Either E{q\) cE{qi)UE{qj) or q\ contains 
edges from some other paths. For the first case (Fig. |6}a)), 
rearranging the paths as shown in green does not change 
total path length. I.e., Properties ISlfTOl still hold. For the 
second case, we may assume that E{q\)\{E{qi) U E{qj)) 
belong to some other paths q/^ (applying similar reasoning 
used in the first case, we can always get such a (7^. via 
switching heads and tails of paths without changing the total 
path length). The switching shown in Fig. |6|b) gives us q\ 
without changing total path length. After updating Q (now 
contains q[ as an element), we apply the same procedure to 
Q\{q'i} and so on; the end result is a path set satisfying all 
desired properties. □ 



If we schedule agents using a path set Q satisfying 
properties l8lfT6] there can never be cases where two agents 
block each other, as a direct consequence of Lemma [12] 
There is still the possibility that one agent blocks another. 
The following theorem shows that such blocking can be 
minimized. 

Theorem 17 Given an instance of Problem |5] with input 
parameters (G,A,x/,xg) and let i be the largest pairwise 
distance between a member ofx[(A) and a member o/x(j(A), 

1= max dist{u,v). (6) 

yuexi{A).yexG{A) 

A time-expanded network jV' with T =n + £—l is necessary 
and sufficient for a feasible solution to Problem\3\to exist. 

Proof. 

(Necessity) We construct an instance of Problem |3] shown 
in Fig. |7] The graph G is two stars with centers connected 
by a single path; the red vertices form x/(A) and the blue 
ones xq{A). It is clear that all red vertices are of distance 
I to all blue vertices. Given this graph G, only one agent 
can go from a red vertex to the adjacent black vertex at a 
single time step. As such, it takes at least n time steps for 
the last agent at a red vertex to go to the neighboring black 
vertex. After that, it takes the last agent l~\ steps to reach 
a blue vertex. Therefore, a total of T ^ n + 1— \ time steps 
is necessary. 




Fig. 7. An instance of Problem [3] for demonstrating necessity claim of 
Theorem 1171 

(Sufficiency) Lemma [15] guarantees the existence of a 
path set satisfying Properties |8][T6l we work with such a 
path set Q and schedule it to get a path set P in the time- 
expanded network. The schedule is fairly simple: At each 
f = / — 1, 1 < / < n, we let agent a,- move along qi. The claim 
is that no collision will occur withing the scheduled path 
set P, which we prove via induction. For the base case, a\ 
starts at head{qi) at f = 0. By Property [16] no other initial 
locations can be closer to tail{q\) than head{q\). Since other 
paths start later, they cannot get in the way of ^I's schedule, 
which we denote pi. Therefore, p\ cannot collide with any 
other scheduled paths before it reaches its goal. 

For the inductive case, assume that {qi,. . . ,qi^_\} can be 
scheduled to get {pi , . . . , pi^_ \ } without collision. We need to 
show that {<7i , . . . ^qk] can be scheduled to get ... ,/?,(.} 
without collision. We use the property that tail{q\) is a 
standalone goal vertex, which means that no other qij ^ 1 
will ever pass tail{q\). That is, p\ cannot collide with any 
other path on or after the time it reaches its goal, tail{qi). We 



can effectively remove qi from the set {^i, . . . ,qk} and apply 
induction hypothesis to {q2, . . . ,qk} to see that {p2, . . . ,pii} 
contains no pairs that will collide. Adding pi back proves 
the inductive case. 

Since len{qi) < £ and no schedule delays a path by more 
than n — 1 time steps, T =n + £—l is sufficient for schedule 
the path set Q. □ 

Since £ cannot be larger than V, the number of vertices of 
G, the following corollary is immediate. 

Corollary 18 For every instance of Problem \3\ a feasible 
solution exists. 

In particular, the construction in the proof of Lemma [12] 
yields a complete (albeit not necessarily efficient) algorithm 
for Problem [3] In addition to confirming that any maximum 
flow algorithm over the time expanded network with 
T = n + £ — 1 is a also complete algorithm. Theorem [17] 
enables us to show that such algorithms are efficient. A 
combinatorial algorithm is an algorithm that only adds, 
subtracts, and compares values; no multiplications and divi- 
sions are allowed (i.e., ordered group operations versus field 
operations). An algorithm is a strongly polynomial algorithm 
if: 1. The number of operations in the arithmetic model of 
computation is bounded by a polynomial in the number of 
integers in the input instance, and 2. The space used by the 
algorithm is bounded by a polynomial in the size of the input 
[16]. 

Theorem 19 Problem \3\ is solvable using a combinatorial 
algorithm in strongly polynomial time. 

Proof. 

Since £ is the length of some shortest path on a 
connected graph G, £ = V — 1 in the worst case. 
Hence, in the worst case, T = n + V — 2. Following 
the construction algorithm given in Section [HI] the 
time-expanded network ,yK' has an underlying directed 
graph G' with 0{T) copies of the undirected graph 
G (the gadget and extra edges only introduce a small 
constant). The graph G' then has 0{V{n + V — 2)) vertices 
and 0{E{n + V — 2)) edges. Denoting the running time 
of an maximum flow algorithm as MF{n,ny,ne) in 
which n,ny,ne are the number of agents, vertices, and 
edges, respectively. Problem [3] can be solved in time 
0{MF{n,Vin + V -2),E{n + V -2))) - 0{MF{n,V^,VE)) 
(there cannot be more agents than vertices), which is 
strongly polynomial since many polynomial time maximum 
flow algorithms exist. □ 

Using the Ford-Fulkerson algorithm [11], the time com- 
plexity is 0{nVE). In practice, even better running times 
are possible. If G is a planar graph, we have E ^ 
Oiy) and £^ OiV^). The time complexity then becomes 

c>(MF(«,y(« + yi -2),y(« + y3 -2))) - o(mf(«, y(n + 

y2),y(n + y2))). since in our case n <V, Ford-FuUcerson 
gives us the running time (9(«y(« + y5)) — 0{n^V + nVi). 



V. Optimal Solutions 
In this section, we present optimal solutions for the 
permutation invariant multi-agent path planning problem. 
After introducing several temporal and spatial objectives 
of practical importance, we apply techniques from network 
flow to obtain optimal solutions for two of these objectives. 
Since these objectives are different from the basic version of 
Problem [3] we provide updated bounds on T , the number of 
times steps sufficient for a solution to exist. Lastly, we show 
that these objectives possess a Pareto optimal structure and 
they cannot be optimized simultaneously. 

A. Optimizing over the Feasible Solutions 

Having found feasible solutions to Problem |3] we turn 
the focus to the optimality of these solutions for practical 
purposes. As mentioned in Section |II] we intend to use the 
formulation as a model for scenarios such as multi-robot 
servicing. For many applications, time optimality is a top 
priority. Optimizing over the feasible solutions to Problem [3] 
(that is, we require that all goals are reached), there are two 
natural criteria for measuring time optimality: 

Objective 20 Minimizing the average time it takes all agents 
to reach their goals. 

Objective 21 Minimizing the time it takes for the last agent 
to reach its goal. 

In terms of agents (robots or people) serving requests. 
Objective |20] seeks to minimize the average time before a 
request gets served, which is arguably the most efficient 
approach. The time steps sufficient for an optimal solution 
to exist for this objective is (« — l)(n — 2)/2 + V. 

Theorem 22 There exists an optimal solution for Objective 
\20\ in a time-expanded network with r = («— l)(n — 2)/2 + 
V. 

Proof. Let P = {/?i,...,p„} be an solution path set with 
minimum total arrival time. The total completion time for P 
cannot be less than ^"=1 len{pj), since that is the shortest 
possible distance to travel (more time may be needed since a 
robot may stay idle at an intermediate vertex). If we leave any 
path pk out, the total arrival time it takes for all other pi's to 
finish cannot be less than ^"=1 len{pi) — len{pi). For any pi^, 
len{pii) <V —1, because interchangeability of robots means 
that it is never necessary for any robot to revisit a vertex. 
On the other hand, if we start with a shortest unscheduled 
path set Q — {^i,...,^„} and schedule them according to 
Schedule ??, the total finish time cannot be more than 

1=1 (=1 
This provides an upper bound on the minimum total arrival 
time. By the minimality of path set Q, we have (the fact that 
y — 1 — len{pii) > is used) 

n n 

J2len{p,)-len{pk)+V-l>J2len{qi)- (8) 

i=l i=l 



Since the total time it takes for paths other than pii to finish 
is at least ^1^^len{pi) — len{pii), the time it takes for any 
path pii e P to finish cannot be longer than 

V/en(^,)H {y^len{pi)-len{pk)) 

i=i ^ i=i (9) 



Unfortunately, existing results on network flow do not 
seem to translate into a polynomial time algorithm for 
optimizing Objective |20] The second objective, minimizing 
the time that its last goal is reached, provides a lower bound 
on the time that is required to reach all goals. Solutions 
optimizing this objective are useful in providing worst ser- 
vicing time estimate or guarantee. Solutions to the quickest 
transshipment problem [19] yield optimal solutions to this 
objective. However, we can avoid using submodular function 
optimization routines if we have a polynomial bound on T, 
which is provided in the following corollary of Theorem [17] 

Corollary 23 There exists an optimal solution for Objective 
\21\ in a time-expanded network with T — n + i ^ I. 

To see that Corollary |23] is true, note that T = n + £— 1 
is sufficient for finding a feasible solution, which must have 
completion time as large as that of a solution to Objective 
\2l\ With the bound on T, running log T rounds (via binary 
search) of maximum flow over time-expanded network with 
different time horizon then gives us an optimal solution 
to Objective |2T] The running time is then bounded by 
0{MF{n,V^ ,VE)\ogV), which is strongly polynomial. In 
particular, with Ford-FuUcerson, the running time becomes 
0{nVElogV). 

After time optimality, another very useful solution prop- 
erty to optimize is the total distance traveled by the agents, 
i.e., spatial optimality: 

Objective 24 Minimizing the total distance traveled by the 
agents on G. 

Because we work with a CUG, if an agent a, actually 
moves along path pi between time steps t and t + l, pi{t) 
must be different from /?,(f + l). These correspond to the 
black edges in the time-expanded network (see Fig. [3jb)). 
Thus, to optimize this objective, we can find the shortest 
total distance traveled by all agents via setting the cost of 
the holdover edge (green edges in Fig. IHb)) to zero and 
then running minimum cost maximum flow algorithm over 
the time-expanded network. The method is again strongly 
polynomial, with complexity 0{V^E\ogV), due to the fol- 
lowing corollary. 

Corollary 25 There exists an optimal solution for Objective 
\24\ in a time-expanded network with T = n-\-£—l. 



Proof. Using the scheduling method for estabhshing 
the sufficiency condition of Theorem [17] on a path set Q 
satisfying Properties l8]fT6] yields the bound. □ 

B. Pareto Optimality Between the Objectives 

From the discussion in the previous subsection, we observe 
that each of the three objectives is of practical importance. 
At this point, one might be tempted to seek solutions that 
optimize multiples of these objectives simultaneously. We 
show that this is not possible for each pair of these objectives. 
In the following theorem, we say that two objectives are com- 
patible if and only if they can be optimized simultaneously. 
Otherwise, we say the objectives are incompatible. 

Theorem 26 Over the feasible solutions to Problem \3\ Ob- 
jectives \20i24\ are pairwise incompatible. 

Proof. 

For each pair of objectives from the three objectives, we 
provide an instance of Problem |3] with which we demon- 
strate the existence of Pareto optimal solutions. 

First we look at Objectives |20] and |2T| For this pair we 
use G from Fig. [HJa) and for the rest of this proof, the red 
vertices are the initial locations and the blue vertices the goal 
locations. The optimal solution for Objective |20] is for the 
agents to take the solid paths connecting the sources and 
sinks, which has a value of ^^^4'^' ~ \. These paths yield 
a value of 3 for Objective |2T| since the longest path has a 
length 3. We may write these two values as a vector (5,3). 
If we optimize for Objective 1211 then the dashed paths give a 
value 2 and these paths yield an average time of ^+^+'^+'^ — 2. 
In vector form, this is (2,2). Hence, Objectives 1201 and 1211 
are incompatible. 




(a) (b) 

Fig. 8. Flow networks used in tlie proof of Tlieorem l26l 



To show that Objectives |20] and |24] are incompatible, we 
use the graph from Fig. ISb), which can be embeddable in 
a three dimensional grid (a 2D version can be constructed 
as well, but that will require more vertices and edges). For 
Objective |20l the optimal solution is letting the agent on the 
rightmost red vertex take the path given by the dashed edges 
and letting the rest of the agents move through the green edge 
successively, which will in turn require some agents to wait 
a little. This gives an average time of ^^^4^^^ = 6.25. These 
paths yield a total travel distance of 3 + 3 + 3 + 5 = 14. If we 
optimize over Objectivel24l then all agents should go through 
the green edge. This gives an average time of ^^^4^^^ = 6.5 
and a total travel distance of 3 + 3 + 3 + 3= 12. In vector 



form, we may write these results as (6.25, 14) and (6.5, 12). 
Clearly, Objectives |20] and |24] are incompatible. 

Finally, for Objectives ISTI and l24l we may reuse the graph 
from Fig. Ufa). It is not hard to check that the vectors we 
obtain for this case should be (2,8) and (3,6), which shows 
that the objectives are incompatible. □ 

VI. Permutation Invariant Multi- agent Path 
Planning Problem on CUGs with Goal 
Replacement 

After a thorough discussion of Problem |3] we shift our 
attention to Problem]?] which allows multiple agents to reach 
the same goal. An algorithm for finding a feasible solution 
to this problem is a trivial modification to the algorithm 
for solving Problem |3] During the construction of the time- 
expanded network ,yV' , instead of letting the edges (v', v")'s 
for a goal vertex v have unit capacity, we let them have 
infinite capacities, which allows the reuse of a goal vertex 
of G at multiple time steps. An identical version of Corollary 
Q can be stated and proved with the same proof. Similarly, 
Theorem [TtI continues to apply. That is, a feasible solution 
to Problem |4] can be found in strongly polynomial time. 

Given feasible solutions to Problem |4] we may again ask 
optimality questions about these solutions. Objectives |20l - 
|24| from Section |V] still make sense here. Since Problem |3] 
and |4] are quite similar, it is not surprising (as readers can 
easily verify) that the same techniques used to optimize these 
objectives for Problem |3] still apply. For Problem^] however, 
another objective on time optimality is also useful in practice. 

Objective 27 Maximizing the number of agents arriving at 
each time step f = 0, 1 , . . . , T, with smaller t 's having higher 
priorities. 

In the domain of network flow, Objective|27]is often called 
earliest arrival flows. It is applicable to time critical scenar- 
ios, such as evacuations due to emergencies (fire, diseases, 
etc.), in which the environment may become hazardous at any 
given time and render further evacuation efforts impossible. 
We did not include this objective in earlier discussions 
on optimality since the objective is locally greedy and an 
optimal solution may not be a feasible solution to Problem 
|3] One simple example is illustrated in Fig. |9ja). Because of 
the local greedy selection, S2 gets paired up with s^ , which 
makes goal Sj" unreachable under the formulation of Problem 
|3] Also, in case where all goals are reached. Objective 
I27] may cause later goals to be reached in arbitrarily long 
time and after traveling arbitrary far Fig. 0b) provides an 
example. Here, local greedy selection forces s'f^y to be paired 
up with sj for 1 </<«—!, which in turn forces s^ to be 
paired up with s^ . Thus, one agent must go through the 
upper (dashed) path, which can be made arbitrarily long. 

However, when goal replacement is allowed, since each 
goal can be used arbitrarily many times, problem instances 
such as these from Fig. |9] are no longer problematic. This 
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Fig. 9. a) An instance of Problem |3] for which an optimal solution to 
Objective 1271 cannot get to both goals, b) An instance of Problem |3] for 
which an optimal solution to Objective 1271 requires arbitrary long time and 
distance to reach all goals. 

makes Objective [27] a good optimization criterion. The 
objective can be optimized by running minimum cost max 
flow over the time-expanded network. More interestingly, 
with goal replacement allowed, Obiectives |20ll2n and|27lcan 
be optimized simultaneously [20]. In particular, optimizing 
Objective |22] guarantees that the other two objectives are 
also optimized. We may use any optimization method for 
Objective |22] to optimize all three objectives. The shortest 
total distance objective. Objective |24] however, cannot be 
achieved with the other three; the proof of Theorem |26] 
showing that Objective |20] and |24] are incompatible still 
holds. 

VII. Conclusion, Future Work, and Open 
Problems 

In this paper, we established the close link between two 
classes of problems: Multi-agent path planning on CUGs and 
network flow. Focusing on the permutation invariant versions 
of the multi-agent path planning problem, we proved a tight 
bound on the number of time steps necessary and sufficient 
for a feasible path set to exist in the time-expanded network, 
enabling efficient algorithmic solutions to these problems. 
We then explored optimality issues, demonstrating that the 
time-expansion bound generally carry over to yield strongly 
polynomial algorithms for optimizing two practical objective 
functions. Interestingly, each pair of these objectives cannot 
be optimized simultaneously. 

Given our study, an immediate question or criticism is 
the applicability of the results to problems beyond CUGs. 
After all, real agents, whether robots or people, do not always 
live on a discrete graph. To answer this question, we have 
research under way that explores the idea of overlaying the 
CUGs on the actual workspace. That is, we may first create 
a roadmap over the workspace that captures the connectivity 
and then discretize the roadmap over which the statement 
of Observation |2] continues to hold (as long as the edges 
are close to unit length the angle between two edges is 
obtuse, similar version of Observation |2] can be stated) [35]. 
A basic solution (Fig. [Tol a)) may be to put a grid on the 
roadmap and delete vertices inside or close to obstacles. To 
overcome the issue of the inherited Manhattan metric of 
grids, we may adapt the grid to align with the geodesies 
of the environment. For example, for a two dimensional 
workspace with polygonal obstacles, we can arrange the 
grid edges to follow edges of the visibility graph [29] of 
the environment when possible. When clearance is tight, we 
may start with a maximum clearance roadmap [35] and add 



the vertices carefully (see Fig. [TO} b)). Note that because the 
workspace is often two dimensional, these preparations can 
be computed relatively efficiently. 




Fig. 10. a) Overlaying grid on a workspace with obstacles, b) Adapting 
a roadmap to obtain a graph that can be used with our multi-agent path 
planning algorithms. 

Many interesting open problems remain. Although finding 
a distance optimal solution to Problem [T] using a time- 
expanded network is impractical due to its intrinsic hardness, 
the network flow approach might still produce efficient 
methods that yield basic feasible solutions since the time- 
expanded network has a forward only structure. In addition, 
approximation algorithms on integer multi-commodity flow 
could lead to better heuristics for optimal solution search. 
Along this line, we only touched the most essential results 
in the field of network flow, which are but the tips of an 
iceberg. It would not be surprising that results from the vast 
amount of network flow literature could be readily carried 
over to tackle path planning problems, as we proposed in this 
paper or in some other forms. As an example, for Problem 
|3] since Objectives l20ll24l are all of practical concerns but 
incompatible, it is desirable to seek solutions that provide 
performance guarantees on each of these objectives. Network 
flow methods, closely relate to linear programming, appear 
to be promising tools for such parametric optimization tasks. 
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